Efficient data mappings for parity-declustered data layouts

نویسندگان

Eric J. Schwabe

Ian M. Sutherland

چکیده

The joint demands of high performance and fault tolerance in a large array of disks can be satisfied by a parity-declustered data layout – an arrangement of data and redundant information that allows the rapid reconstruction of lost data while the array continues to operate. A data layout is typically generated by partitioning the data units on the disks into stripes and choosing one or more units per stripe to hold redundant information. Such a data layout can be represented as a table of stripes. The data mapping problem is the problem of translating a data address in a linear address space (the file system’s view) into a disk identifier and an offset on the disk where the data is stored. Typically, the disk and offset are obtained from the data layout using table lookups, but recent work has yielded mappings that compute (disk, offset) pairs directly from data addresses without the need to store tables. In this paper, we show that parity-declustered data layouts based on commutative rings yield mappings with improved computational efficiency. These layouts also apply to a wider range of array configurations than other known layouts that do not use table lookup.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Permutation Development Data Layout (PDDL)

Declustered data organizations in disk arrays (RAIDs) achieve less-intrusive reconstruction of data after a disk failure. We present PDDL, a new data layout for declustered disk arrays. PDDL layouts exist for a large variety of disk array configurations with a distributed spare disk. PDDL declustered disk arrays have excellent run-time performance under light and heavy workloads. PDDL maximizes...

متن کامل

Permutation Development Data Layout (PDDL) Disk Array Declustering

Declustered data organizations have been proposed to achieve less-intrusive reconstruction of a failed disk’s contents. In previous work, Holland and Gibson identified six desirable properties for ideal layouts. Ideal layouts exist for a very limited family of configurations. The PRIME data layout deviates from the stated ideal only slightly and its run-time performance is very good for light w...

متن کامل

An Example of Data Dependence Result for The Class of Almost Contraction Mappings

In the present paper, we show that $S^*$ iteration method can be used to approximate fixed point of almost contraction mappings. Furthermore, we prove that this iteration method is equivalent to CR iteration method and it produces a slow convergence rate compared to the CR iteration method for the class of almost contraction mappings. We also present table and graphic to support this result. F...

متن کامل

Compound Treatment of Chained Declustered Replicas Using a Parallel Btree for High Scalability and Availability

Scalability and availability are key features of parallel database systems. To realize scalability, many dynamic load-balancing methods with data placement and parallel index structures on sharednothing parallel infrastructure have been proposed. Data migration with range-partitioned placement using a parallel Btree is one solution. The combination of range partitioning and chained declustered ...

متن کامل

Spectral Characterization of Iterating Lossy Mappings

In this paper we study what happens to sets when we iteratively apply lossy (round) mappings to them. We describe the information loss as imbalances of parities of intermediate distributions and show that their evolution is governed by the correlation matrices of the mappings. At the macroscopic level we show that iterating lossy mappings results in an increase of a quantity we call total imbal...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

Theor. Comput. Sci.

دوره 325 شماره

صفحات -

تاریخ انتشار 2004

Efficient data mappings for parity-declustered data layouts

نویسندگان

چکیده

منابع مشابه

Permutation Development Data Layout (PDDL)

Permutation Development Data Layout (PDDL) Disk Array Declustering

An Example of Data Dependence Result for The Class of Almost Contraction Mappings

Compound Treatment of Chained Declustered Replicas Using a Parallel Btree for High Scalability and Availability

Spectral Characterization of Iterating Lossy Mappings

عنوان ژورنال:

اشتراک گذاری